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Abstract: This paper presents the design and test of a simple active near-infrared sparse 
detector imaging sensor. The prototype of the sensor is novel in that it can capture 
remarkable silhouettes or profiles of a wide-variety of moving objects, including humans, 
animals, and vehicles using a sparse detector array comprised of only sixteen sensing 
elements deployed in a vertical configuration. The prototype sensor was built to collect 
silhouettes for a variety of objects and to evaluate several algorithms for classifying the 
data obtained from the sensor into two classes: human versus non-human. Initial tests show 
that the classification of individually sensed objects into two classes can be achieved with 
accuracy greater than ninety-nine percent (99%) with a subset of the sixteen detectors using 
a representative dataset consisting of 512 signatures. The prototype also includes a Web- 
service interface such that the sensor can be tasked in a network-centric environment. The 
sensor appears to be a low-cost alternative to traditional, high-resolution focal plane array 
imaging sensors for some applications. After a power optimization study, appropriate 
packaging, and testing with more extensive datasets, the sensor may be a good candidate 
for deployment in vast geographic regions for a myriad of intelligent electronic fence and 
persistent surveillance applications, including perimeter security scenarios. 

Keywords: Electronic fence, imaging sensor, sparse detector array, object identification. 
Web-service interface 
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1. Introduction 

By definition, a sparse detector sensor is an imaging device that has a relatively sparse detector array 
as compared to state-of-the-art imaging sensors. Sparse detector sensors may be a low-cost alternative 
to traditional, high-resolution imaging sensors, which use dense focal plane arrays, for object 
classification. Size, cost, and power restrictions preclude the use of traditional imagers in applications 
that require widespread deployment of the sensors and in scenarios in which the sensors must be 
regarded as disposable. Classification of sparse detector sensor data is of particular interest when 
building inexpensive, unattended ground sensors; however, robust classification is challenging due to 
the paucity of information that can be used to detect and identify various objects. 

The design of an unattended ground sparse detector imaging sensor for broad-scale object 
classification has been prototyped in our laboratory [1-3] with support from the U. S. Army Research 
Laboratory (ARL). This prototype sensor is being designed and evaluated, in part, to address the need 
to monitor trails and unimproved roads, which provide routes for drug smuggling traffic, as well as 
other applications in which broad-scale classification of sensed objects is of high interest [4-5]. 
Unattended ground sensors that can reliably distinguish between humans and animals are critically 
needed for several other potential military, homeland security, and commercial applications. 

This work complements ongoing research at the Center for Advanced Sensors at the University of 
Memphis to develop a network of low-cost sensors and intelligent signal processing algorithms that 
detect and provide a broad-scale classification of humans and vehicles, while ignoring non-utility 
animals. Since trails and unimproved roads are often the point of entry for illegal aliens, smugglers, 
and terrorists, effectively monitoring these areas represents a significant challenge to national security. 
The United States' border with Canada and Mexico is approximately 12,000 km, with many remote 
and uninhabited sections. Persistent monitoring of these areas is currently not feasible and is restricted 
to high-traffic areas; however, ubiquitous deployment of sensors to monitor the entire border is of 
extreme interest. For example. The Department of Homeland Security (DHS) is leading the 
development of technology for border security via the SBI.net project [6]. This project is attempting to 
use advanced sensors, such as moving target indication (MTI) radar and thermal infrared cameras, in a 
network-centric environment, to perform detection, classification, and tracking of humans crossing the 
border. When fully developed, the estimated cost to deploy this technology along the entire U.S. 
border is approximately $620K per km $1 million dollars per mile of border) [7]. 

Innovative use of alternative low-cost sensors, including imaging sensors, and networking 
technology may provide a competing or complementary monitoring capability for borders and other 
application scenarios. For example, suppose a suite of low-cost sensors could be developed that 
provide a significant percentage of the functionalities offered by the technologies selected for the 
SBI.net project, but they could only do so for a coverage area of 10 m. If the cost of each sensor was 
$100 and it costs an equal amount for installation, the total cost would be approximately $20K per km 
of border. If each sensor had a 10-percent (10%) per year failure rate, there would be an additional 
recurring cost of $2K per km. Even doubling these costs, there is still a substantial margin between the 
potential costs of the sensors envisioned as part of our work and the technologies currently under 
development for comprehensive electronic border security. 
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The remainder of this paper details the design and evaluation of one approach to a low-cost imaging 
sensor that has a sparse detector array. It is proposed that such a sensor could be a component of a 
ubiquitous, low-cost sensor network. Section 2 introduces our sparse detector imaging sensor 
prototype and an initial motivating application and provides the details of the sensor developed in our 
laboratory, including the acquisition of images used for subsequent classification. Section 2 also 
provides details about the Web-services interface and issues for future deployment. Section 3 
highlights an initial approach used for classifying the images from the sensor into human and non- 
human classes. Section 4 offers conclusions and future directions. 

2. Sparse Detector Imaging Sensor 

2.1. Motivating application 

Typically, smugglers on foot use large packs to transport contraband weighing up to 50 kg along 
known trails and unimproved roads across the border between the U.S. and Mexico. Smugglers often 
travel in large groups and the border patrol has inadequate personnel to monitor these vast geographic 
regions. Therefore, a high degree of confidence in classification algorithms is needed to provide 
notification when objects of interest are detected to allow authorities to assemble adequate personnel to 
intercept and apprehend the smugglers along known points on the trails or roads. Moreover, numerous 
and inexpensive unattended ground sensors are needed for placement at several locations and these 
sensors should be resilient to false alarms as there is inadequate capacity among the authorities for 
reacting to false alarms. In our initial application domain, such trails and unimproved roads are 
abundant, but most of the routes are known. However, the routes have many travelers that are not of 
interest, including non-utility animals and humans that do not fit the profile of interest. In typical 
deployments, a sensor would be placed near a trail having a width of approximately 1 to 1.5 m or near 
an unimproved road of width of approximately 3 to 5 m. These sensors can be located where 
vegetation or other environmental features can be used to hide placement. The trails and roads are 
located in areas that are considered open range. Wild horses, cattle, deer, large cats, dogs, rabbits, and 
pigs are just a few of the non-utility animals that use the same trails as humans moving through the 
area. Figure 1 illustrates a variety of object types for which object identification is of high priority. 

2.2. Sparse Detector Model 

Our prototype sparse detector imaging sensor system can be viewed as a near-infrared 
implementation of the model described by our colleagues at the Center for Advanced Sensors as shown 
in Figure 2. As modeled in Robinson et al. [8], the sparse detector system consists of an array of 
sensing elements deployed in a vertical configuration on a transmitting/receiving platform. The sensor 
system also has a reflecting platform that is placed at a distance represented by widthsys. Each sensing 
element has a detector and a dedicated collecting lens component. Each sensing element is arranged 
such that its optical axis is perpendicular to the plane of the vertical array comprised of N sensing 
elements that are placed at a distance of dpuch apart. The range to the object of interest along the optical 
axis is represented by R. The field of view (FOV) of each sensing element is calculated as a function of 
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the detector area and optics of each sensing element. The overall height of the sensor system is 
represented by heightsys. 



Figure 1. (a) Human with large backpack, (b) Utility animal with packs, (c) Vehicles, such 
as SUVs. 






(c) 




Robinson et al. [8] have developed and used this model for rough trade-off analyses, which include 
the effects of the optics, atmosphere, detectors, object-of-interest characteristics, and system 
characteristics, such as detector pitch and normalized detectivity. As further described in [8], the 
sensor can be classified as either having a staring or a scanned system type [9]. Since it consists of a 
stationary sparse detector array with no device for scanning the image across the detectors it could be 
viewed as a staring system in which the vertical resolution is dependent upon the deployment of the 
individual sensing elements, which comprise the overall device. However, it can also be regarding as a 
scanning system since the image will effectively be 'scanned' across the detectors by the object-of- 
interest' s motion in which the horizontal sample rate of the system will be determined by the 
integration time of the detector. 
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2.3. Sensing elements 



The CX-RVMS retro-reflective infrared sensor [10], as shown in Figure 3, was used as the principal 
sensing element to construct the prototype sensor system to obtain signature data in a laboratory and 
controUed-field environment for further analysis. The CX-RVMS was selected because of its 
suitability for both laboratory and controUed-field testing. It has environmental reliability, which 
includes BSi IP67 waterproof construction, and it is vibration resistant with its interior fully filled with 
resin. Moreover, this sensing element has a 1 ms or less response time, 5 m sensing range, and can 
operate from -25 to 60 °C. An alternative, lower-power sensing element would be required for wide- 
scale deployment; however, the CX-RVMS served our purposes for proof of concept and to acquire 
initial signature data. 



Figure 3. CX-RVMS retro-reflective photoelectric sensing element (all units mm) [10]. 

-25- 

(1.9)3 ~^ 



16.5 1 



, Beam-receiving i 1 
^ part J=t 



EE, 




Beam-emitting 
part 



2-IV13 X 0.5 thru-hole thriBads 




^3.7 cable 2m long 



2.4 Sensor system assembly 

The prototype sparse detector sensor was assembled by placing 16 CX-RVMS retro-reflective 
sensing elements at 12.7-cm intervals in a vertical conflguration (A^=16, dpuch = 12 J cm, and heightsys « 
2.2 m with respect to the model in Figure 2). Each sensing element can be regarded as an optical trip 
wire and was attached to a supporting platform with reflectors mounted on an opposing platform to 
provide the break-beam curtain. Each CX-RVMS sensing element was interfaced to a USB-DIO-32 
digital input board using a simple voltage divider breadboard circuit to provide the required 0 to 5V 
output. The input board was subsequently connected to a host computer via a USB interface. Figure 
4(a) illustrates the conflguration in which sensing element BO (the flrst sensing element) is placed at 
17.7 cm from the platform base and sensing element C7 (the sixteenth sensing element) is placed 
approximately 208.2 cm from the platform base. Figure 4(b) is a photograph of the 
transmitting/receiving platform with the sensing elements interfaced to a host computer in the 
laboratory environment and Figure 4(c) illustrates a human walking between the platforms with 
widthsys ^ 1.2 m. 
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Figure 4. (a) Prototype sensor logic diagram, (b) Transmitting/receiving platform, (c) 




(a) (b) (c) 



2.5. Data acquisition 

The driver software for the digital input board interfaced to the sensor uses a 32-bit DLL compatible 
with any Windows programming language. A C/C++ program was developed to acquire signature data 
assuming at least one sensing element's beam would be broken as an object traverses through the 
sensor's optical curtain for initial testing purposes. Therefore, data acquisition started when a break- 
beam event first occurred and continued at a sampling rate of approximately 10 ms until none of the 
sensing elements detected a break in their beam. 



Figure 5. (a) Human with backpack, (b) Break-beam pattern from the sensor for a human 
wearing a backpack, (c) Silhouette generated from the break-beam pattern acquired by 
the sensor. 




(a) (b) (c) 

A raw dataset is created by the C/C++ program by writing a single ASCII file for each object 
detection event as a string of Is and Os corresponding to no-break and break, respectively, for sensing 
element BO through C7 as the object passes through the sensor's optical curtain. Each of the 16 sensing 
elements is sampled in parallel, which provides data for a ' 16 x /' matrix. Variable is the number of 
samples taken for each sensing element and will be constant within a single file; that is, the same 
number of samples will be taken for sensing elements BO through C7 for a given object's data 
acquisition. However, will vary among files as it depends on the specific object and the interval of 
time in which at least one sensing element detected a break-beam event, that is, depends on the 
speed, size, configuration, and other variables of the object. Figure 5(b) is a visualization of the optical 
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trip wires for a typical break-beam pattern for a human with a small backpack and was produced using 
the Visualization Toolkit (VTK) [11]. 

Figure 6. Silhouettes created from break-beam patterns acquired from the sparse detector 
imaging sensor: (a) human with large backpack, (b) human without backpack, (c) two 
humans with large backpacks, (d) two humans without backpacks, (e) donkey, (f) llama, 
(g) horse with human rider, (h) horse with pack led by human, (i) sport-utility vehicle 
(SUV), (j) pick-up truck, (k) van. (1) car. The x-axis denotes the number of samples and the 
y-axis denotes a sensing element number (1-16). 




(e) (f) (g) (h) 




(i) G) (k) (1) 

Figure 5(c) illustrates a representative 'crude image' or silhouette that was created from the break- 
beam pattern acquired from the prototype sensor using a MATLAB program. For the testing and 
evaluation of the sensor and the human versus non-human classification task using silhouettes 
described later in this paper, a total of 512 datasets were acquired from the sensor, consisting of 137 
animals, 226 humans, and 148 vehicle-type signatures. Figure 6 presents representative silhouettes for 
a variety of objects that were acquired from the sensor, including: (a) human with large backpack; (b) 
human without backpack; (c) two humans with large backpacks; (d) two humans without backpacks; 
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(e) donkey; (f) llama; (g) horse with human rider; (h) horse with backpack led by human; (i) sport- 
utility vehicle (SUV); (j) pickup truck; (k) van; and a (1) car. 

For each image in Figure 6, the x-axis lists the number of samples taken to generate the silhouette 
and the y-axis lists a sensing element number (1-16). Note that the sensing element order on the y-axis 
of each image is reversed from the order used to configure the sensor in Figure 2. The maximum value 
of the X-axis provides a relative indicator of the speed and the width of the object or objects in the 
silhouette since wide objects would break the optical curtain for a longer time than narrow objects. The 
X-axis maximum value can also be used to compare the relative speed of similar object types, that is, a 
human running would result in fewer samples acquired by the sensor as compared to a similarly sized 
human walking through the optical curtain. 

These images or silhouettes are remarkable given that they were produced from a sensor with only 
16 detectors placed at 12.7-cm intervals. The data was collected with the sensor configured with 
widthsys ranging from approximately 1 to 5 m (that is, the distance between the transmitting/receiving 
and reflecting platforms). In the case of multiple objects traversing between the sensor's 
transmitting/receiving and reflecting platforms, as in Figures 6(c), (d), (g) and (h), our data acquisition 
program required a continuous break of at least one optical trip wire (sensing element) to acquire the 
data for both objects in one dataset file. Multiple objects were not used as training datasets for the 
initial classification algorithms reported later in this paper. A number of samples were acquired for 
each object type as appropriate, for example, different strides, postures, orientations, speed, etc. 

2. 6. Service-oriented software interface 

In conjunction with the silhouette collection and classification algorithm development, we have 
been developing software for the prototype profiling sensor to facilitate integration within a network- 
centric environment using service-oriented computing [12] infrastructure and Web services. Network 
access to the sensor has also been developed and has been exposed via Web Services Description 
Language (WSDL) so that client applications can be designed for the profiling sensor using the 
development environment of the programmer's choice. All sensor data and embedded operations, such 
as self-test, sensor sample rate, alarm thresholds, and other configuration parameters and functionality 
are being exposed as Web services using WSDL [3]. Such a service-oriented architecture with Internet 
Protocol (IP) networking provides a framework for including the sensor in a variety of intelligent 
monitoring applications, deployment, and interoperability within existing frameworks. For example, an 
application may need to task a given deployment of the profiling sensor to detect any object that it 
senses. This could be accomplished via the detect_object service call, which requires the client to 
specify the time interval for which the sensor should report if at least one optical beam was broken. 

More sophisticated services, such as detect _object_type, require the client to specify the object type 
to detect (for example, human, human_large_backpack, SUV, horse, generic object, etc.), the threshold 
(e.g., an integer specifying the number of occurrences of the specified object to detect during the 
specified time interval), the time interval for monitoring for the specified object, and an e-mail address 
to notify the recipient of the detection or identification event that satisfies the specified constraints. 
Example Web services for the profiling sensor are represented in Figure 7 using the Unified Modeling 
Language (UML). Note that the majority of the service calls require the sensor to have completed a 
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successful self test to ensure the sensor is aligned and operating correctly, which is represented as a 
UML pre-condition constraint in Figure 7, before the client can successful invoke the other services. 



Figure 7. Example Web services for the profiling sensor described with UML. 



«precondition» 
self.get_sensor_status==1 



Profiling Sensor 



get_sensor_status:int 
.^detect_object(Timelnterval):boolean 

■^detect_object_type(Type, Threshold, Monitoringlnterval, E-mail):boolean 
get_meta_data:Object 
^get_sensor_data(Timelnterval):Object 
'^et_last_reading:Object 



Currently, the prototype version of the sensor provides a WSDL file that serves as a wrapper for a 
Java program that implements the Web service and subsequently invokes the lower-level sensor API 
developed in C/C++ as shown in Figure 8. The WDSL and the associated Extensible Markup 
Language (XML) schema describe all types, methods, arguments, and responses of the sensor. The 
client maps the abstracted types and structures specified by the WSDL file to the specific bindings 
required by the client's host programming language. The client communicates with the sensor via the 
Java Web service using Simple Object Access Protocol (SOAP) calls, which are exchanges of XML- 
based messages over a network. Using this approach, the developer can use the software 
implementation platform of their choice when using the profile sensor in custom applications without 
the burden of knowing the lower-level sensor application programmer's interface (API) or device 
drivers, which is typically required when using sensors and other devices in custom applications. 



Figure 8. Profiling sensor service-oriented interface to client. 
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Light-weight versions of the profiling sensor are planned that are bundled with an embedded 
processor. Such devices will have a very limited computational platform powered via batteries, 
wireless interface, and an environmentally hardened construction. These devices will be evaluated to 
determine the level of computational complexity supported, including the feasibility of hosting a full- 
featured. Web-services interface as is provided by the initial prototype. 

To accommodate ubiquitous deployment, data communication protocols must be carefully 
considered. Minimizing wireless communications saves both bandwidth and power. The break-beam 
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patterns of each sensor are highly compressible, so one approach is to transmit the collected data over 
a wireless communication channel to a base station each time the sensor is activated as an alternative 
to designing the sensor to include an embedded processor to locally classify the data and to process 
various tasking parameters. 

Power requirements are also a major concern in the design of the sensor, as long battery life is 
highly desirable. The choice of detectors affects power consumption along with the choice of data 
communication protocols. Transmission of high-level image classifications consumes less power than 
the transmission of the raw data, even if it is compressed. Naturally, the decision regarding the 
centralized base station approach versus the distributed architecture, with the intelligence on the 
sensor, will be determined by the specific application. 

2. 7 Other deployment issues 

Concealment of the device is also a requirement in many applications. To be effective, a smuggler 
or person crossing a border illegally must not be aware of the location of the sensors. Figure 9 
demonstrates a simple means of camouflaging a notional packaging of a profiling sensor using paint. 
Another means of hiding the device that has been proposed is to distribute the sensing elements 
horizontally along a path. Each beam-break device would operate at both a different height and a 
different distance along the path. By distributing the sensing elements, the profiling sensor system is 
less conspicuous. By keeping track of the location of each sensing element, the profile can be 
constructed by synchronizing the data from the distributed beam-break sensors. The distributed 
approach requires additional computational overhead for coordination, which may be processed either 
locally or at a centralized base station. 

Figure 9. Example camouflage of a sparse detector sensor: (a) Proflling sensor realized 
with sensing elements in PVC pipe and transmitting/receiving platform mounted on a tree, 
(b) Profiling sensor realized with sensing elements in PVC pipe with transmitting/receiving 
platform painted to blend in with a tree [1]. 




(a) (b) 
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While the sensing elements described in this paper are an active near-infrared emitter/detector, it 
would be possible to construct a profiling sensor from passive infrared detectors. Klett et al. [5] 
describe the analysis of optical and radiometric calculations that are necessary to begin evaluating a 
passive infrared sparse sensor system. These types of sensors are often used in motion detectors to 
activate outdoor lighting and domestic security systems. Lenses could be designed to provide a 
sufficiently narrow field of view to capture one element of a profile. This type of detector has the 
advantage of not requiring an active light source or a refiector to operate since the radiation originates 
from the object-of-interest. In addition, a long wave infrared (LWIR) version, in which the majority of 
the sensed energy is emissive rather than refiective, may minimize false alarms, which are common 
with commercial passive infrared detectors [4]. Also, a detailed power analysis and techniques to 
decrease the power consumption of sparse detector sensors and their associated communications 
package is an additional opportunity for research. 



3. Classification Algorithms 



Five algorithms were tested to classify data obtained from the profiling sensor into two classes: 
human and non-human. The algorithms were the Naive Bayesian Classifier (NB) [13], Naive Bayesian 
with Linear Discriminant Analysis for dimensionality reduction (NB + LDA) [14-15], AT-Nearest 
Neighbor classifier (AT-NN) [16], Soft Linear Vector Quantization (SLVQ) [17], and Support Vector 
Machines (SVM) [18]. Although we report profiling sensor classification results in prior work [1-2], 
those results were not for the two-class problem, nor were those algorithms tested with the extensive 
512 datasets described in Section 2.5. For completeness, a review of each algorithm is discussed before 
presenting the latest classification results. 



3.1 . Naive Bayesian classifier 



A Naive Bayesian classifier assigns a test sample to a class with the highest posterior probability 
among the J classes, with the assumption that each feature of the sample is statistically independent 
[13]. The posterior probability of the jth class, P(coj\x), is the probability of a class given the data 
sample (vector) x = [ xi , X2 xm ], where Xi is the /th feature of the data vector. Bayes' theorem 
relates the posterior probability to prior probability of the yth class, P(coj); class conditional probability, 
P(x\cOj); and the total probability of x or evidence, P(x) as shown in (1). 

P(co.)Pix\co.) (I) 

P(x) 

The prior probability P(coj) is given by p{cifj) = ^ , where nj is the number of data samples belonging 
to the jth class and is the total number of samples. The total probability of x, P(x), is shown in (2): 

P(x) = f^P(C0j)P(x\C0j) (2) 

7=1 

If each class is expected to occur with equal probability, then classification based on posterior 
probability depends only on the class conditional probability. The calculation of the class conditional 
probability is commonly performed using the parametric approach, where a probability distribution 
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model, such as the Gaussian distribution, is assumed for the data points. The parameters for the 
distribution (e.g., mean and standard deviation for Gaussian) are then estimated from the training 
samples. Thus, the class conditional probability of the yth class given a data vector can be expressed as 
shown in (3): 

P{X \(Oj) = I (Oj)P{x^ I 0}j)...Pix^ \(Oj) = 

(3) 



exp 



•^1 - M J- 



exp 



^2 - Ml J 



2J ) 



exp 



MJ 



J J 



In Equation (3), represent the estimated mean and standard deviation of Ath feature 

corresponding to the yth class. With the assumption of equal prior probability for each class for a test 
sample vector xtest, if P{xtest\(j)j) has the highest value among all class conditional probabilities, then 
the test sample is assigned to the yth class. 

3.2 Naive Bayesian classifier with Linear Discriminant Analysis for dimensionality reduction 



Dimensionality reduction involves projecting a variable from an M-dimensional to an iS-dimensional 
space, such that M> S. Linear Discriminant Analysis (LDA) is a dimensionality reduction technique in 
which the data is projected onto a lower dimension so that the overlap between the classes to be 
discriminated is minimized making it popular as a preprocessing technique for the Naive Bayesian 
approach. In LDA, each axis in the new space is a linear combination of all the axes in the original 
space. In matrix notation, the LDA transformation can be expressed as in (4): 

Y = W^X (4) 

In Equation (4), Y is the new space with S dimensions, X is the original space with M dimensions, 
and W is the transformation matrix. The LDA algorithm finds a transformation matrix that maximizes 
the Fisher criterion [14] as a function of fTand is shown in (5): 

(5) 



F{W) = 



W'S^W 



For C number of classes, the between class scatter matrix [15], Sb, is shown in (6), in which Pi and 

c 

Mi are the probability of occurrence and mean of the /th class, while = ^P^M^ the within class 

i=i 

scatter matrix [15], Sw, is shown in (7), in which is the covariance matrix of the zth class: 

1=1 

Also, Sb,y=W^Sb,xW and Sw,y= W^Sw,xW, in which Sbj and SwY are the between and within class 

scatter matrices of 7, while Sb,x and Sw,x are the between and within class scatter matrices of X, The 

s 

matrix Wthat maximizes F is the matrix of eigenvectors of — . 

To project the data onto the lower-dimensional space with iS-dimensions, the eigenvectors 
corresponding to the S most significant eigenvalues are used for the transformation. Once the data is 
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projected onto this lower dimensional space, the S features are then used by the Naive Bayesian 
classifier for classification of the data. 

3.3. K-Nearest Neighbor classifier 

The AT Nearest Neighbor classifier assigns a test sample to the yth class if a majority of its K nearest 
neighbors, which are from the training data, belong to the yth class [16]. The training and testing 
samples are defined in an M-multi-dimensional space. The neighborhood is defined using a distance 
measure, such as the Euclidean distance, in which the distance between a test sample, xtest, and any 
training sample x is given by (8): 

(8) 

dist(xtest,x) = l^(xtest. - x.)^ 

The dist(xtest,x) is computed between xtest and every stored data sample (training data). The 
training samples are then sorted in ascending order according to the distances from the test sample. 
The first K training samples are then picked and the test sample is assigned to the class to which most 
of the K samples belong. 

3.4. Soft Learning Vector Quantization 

Learning Vector Quantization is a technique to learn prototypes of various classes in Nearest 
Prototype Classifiers (NPC). Unlike the AT-NN classifier, in which the distance between the test sample 
and all training samples is computed, with NPCs, the distance between the test sample, xtest, and 
prototype vectors for each class is computed. The test sample is assigned to a class whose prototype 
has the least distance from the test sample. Prototypes represent each class in an M-dimensional space. 
One or more prototypes can be used to represent a given class to help accommodate variations within a 
class. If the set {^/, coi) represents the prototype vectors and the corresponding classes, 1,2, ... , A/, 
in which N is the number of prototypes, then, coi is the class of the /th prototype with a>i taking values 
from 1, 2, ... , C, in which C is the number of classes. The NPC approach finds p as shown in (9) in 
which distO is a distance measure such as the Euclidean distance. The test sample is then classified to 
class (Op. 

p = arg mm(dist(xtest, 0. )) (9) 

i 

There are many versions of LVQ, such as LVQl, LVQ2.1, LVQ3, OLVQl, and 0LVQ3 [19]. Each 
version differs in the way the prototypes are updated in the learning process. For example, in version 
LVQ2.1, first, the prototypes for the various classes are initialized. In each iteration, for a training data 
point, two prototypes, Oa, Ob, are picked based on the Euclidean distance from the training data point. 
The two prototypes are not updated if both belong to the same class as the training data point. If 6a 
belongs to the same class as the training sample, and 6b does not, then the prototypes are updated as 
shown in (10): 

^,(^ + 1) = ^,(0 + ^(0(^-^.(0) (10) 
^^(^ + 1) = ^^(0-^(0(^-^^(0) 
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Another condition on the update is that the test point x should lie close to classification boundary; 

therefore, in the previous case, > jh , where Th is a threshold. 

dist(x, 0^ ) 

Seo et al. [17] introduced the Soft Learning Vector Quantization (SLVQ) technique in which the 
prototypes are represented by Gaussian mixtures. The prototype representing the yth class is 
represented by the yth component of the Gaussian mixtures. In other words, it is assumed that a 
particular data point belonging to the yth class is generated by the yth component of the Gaussian 
mixture. While LVQ produce hard decisions, SLVQ generates soft decisions. SLVQ provides degrees 
of membership for a test sample with respect to the various classes. Since the prototypes are 
components of a Gaussian mixture, the jth prototype is described by its parameters: mean and standard 
deviation, 9j = {ju,(j}. Let S = {x,y} represent the set of all training data points, x, and corresponding 
class labels, y, while T = {9, co} represents the set of prototypes, 6, and the classes they represent, co. 
The prototypes 9 are calculated using gradient descent while optimizing the cost function in (1 1): 

N 

Y,p{x,y\T) 

logi = ^^^ <") 

Y,p{x.y\T) + p{x,y\T) 

In Equation (11), p(x,y \ T) is the probability of a data point x being generated by a prototype of the 
correct class and p{x,y\T)i^ the probability of the data point being generated by a prototype of the 
incorrect class. 

3.5. Support Vector Machines 

Support Vector Machines (SVMs) are classifiers that address the two-class problem by identifying a 
separating hyper-plane that leaves the maximum margin from each of the two classes [18]. The hyper- 
plane is defined by h(w) = w^x+wo, in which w and wo characterize the direction and position of the 
hyper-plane, respectively. Since neither of the classes is to be given preference, the hyper-plane should 
be located equidistant from the nearest points from class 1 and class 2. The distance of any point x 
from the hyper-plane is given by (12). Further, w and wo can be scaled so that the distance of the 
hyper-plane from the nearest points in class 1 and class 2 is set to unity. The margin is then given by 
(13). Further, if x belongs to class 1, then w^x+wo > 1 and if x belongs to class 2, then w^x+wo < 1 . 

(12) 

llwll 



1 



llwll llwll llwll 



2 

The goal is to maximize the margin -rr—rr between the nearest points of class 1 and class 2 when the 

w 

classes are separable. SVM finds w and wo by minimizing the cost function (14) under the constraints: 
y^iw^Xi + wo)> 1 where yi = I if the training data point x/ belongs to class 1, and j^/ = 1 if the training data 

point Xi belongs to class 2. 

Ill ll2 (14) 



J(w) = —\\w\ 
2" ' 
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In the case when the two classes are not separable, the optimization is more complicated, and SVM 
finds w, Wo, and by minimizing the cost function [18] shown in (15) under the constraints 
{w^x^ + ) > 1 - , where i=\,2, ,N in which N is the number of training samples and > 0. 

/(w)=i|Hr+ct^,. (15) 

S.6. Classification of the prototype profiling sensor data 

The majority of the algorithms were implemented using MATLAB and the data was classified off 
line. The SLVQ technique was implemented using the RSLVQ toolbox [17]. However, the low-level 
profiling sensor software developed in C/C++ has been designed to easily accommodate the 'plug-and- 
play' of a variety of algorithms to support real-time classification. As described in detail in Section 2.5, 
as an object passes through the sensor's optical curtain, a 16 x / matrix is generated. Each row 
corresponds to the output of a sensing element. For each of the 16 sensing elements, when the break- 
beam event occurs, the samples are recorded as Os; otherwise, the samples are recorded as Is. The 
number of Os for each sensing element was used as features. Thus, for each object, 16 features were 
measured. Furthermore, to make the features independent of the speed of an object, each feature was 
normalized between 0 and 1 by dividing each feature value by the feature with the highest value for 
that object. The overall goal was to classify an object as human or non-human. Vehicles and animals 
formed the non-human class, but if these two classes were grouped into a single class, the variance of 
the non-human class would become very large. To overcome this problem, class 1 represented humans, 
class 2 represented animals, and class 3 represented vehicles. An object was classified as non-human if 
it belonged to class 2 or class 3. Since the SVM technique is devised for two-class problems, when 
SVM is used as a classifier, the one-against-one strategy with majority voting is used to address the 
three-class problem. Table 1 summarizes the classification rate (CR) for the data using the five 
classification algorithms previously described. 



Table 1. CR(%) for human and non-human objects using 16 sensing elements. 



NB 


NB+LDA 




SLVQ 


SVM 


97.4 


96.7 


96.5 


95.1 


99.2 



The classification results indicate that SVM performed the best using the sample datasets, closely 
followed by NB, NB + LDA, ^-NN, and SLVQ, respectively. Errors in the classification rate are also 
influenced by the feature extraction process. The feature extraction process used here did not account 
for the details of the structures in the silhouette. For example, for animals, the fact that some have four 
legs and that they are in pairs, with one pair in the front and one pair in the rear of the animal was not 
accounted for by the feature extraction process. This becomes a critical issue in differentiating a 
human carrying a large backpack from a medium-sized animal, especially when each object is captured 
in a single silhouette. Future research in the area of profiling sensor algorithm development will 
include adding rules in the feature extraction process to accommodate structural information in 
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silhouettes and the capability to identify and classify multiple objects of the same or different classes 
captured in the same image as in Figures 6(c), (d), (g) and (h). 

3. 7. Impact of sensing element configuration on the classification rate 

After the baseline classification rate (CR) for human versus non-humans was established using the 
sensor assembly described in Section 2.4 for the algorithms described in Sections 3.1 through 3.5, 
further analysis was performed to establish additional baseline results. These results document how the 
number of sensing elements and their arrangement impacted the CR with respect to the human versus 
non-human classification task using the 512 datasets previously described. 

The CR was determined for all five algorithms using every subset of the set containing the 16 
sensing elements s= {1,2 ... 16} configured as described in Section 2.4. The highest CR obtained for 
each subset of sensing elements was recorded. The results reported in Figure 10 and Table 2 are for the 
SVM algorithm. The combination of sensing elements that provided the highest classification rate for a 
particular subset of s is shown in Table 2. 

Figure 10. Highest CR(%) for the human versus non-human classification task obtained 
using the SVM classification algorithm for the subsets containing 1 through 16 sensing 
elements. 
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The results indicate that when using the SVM algorithm on the 512 datasets, CRs over 90% for the 
human versus non-human classification task can be achieved using only two sensing elements (element 
4 and 11, with respect to Figure 2). The best CR of 99.7% was achieved using 10 sensing elements. 
The improved performance using 10 sensing elements rather than 16 can be explained as follows. 
There were 16 sensing elements in the nominal sensor system, each of which can be thought of as a 
dimension in a 16-dimensional space. When an object passes through the optical curtain, a data point 
in this multi-dimensional space is generated. Classification algorithms, such as SVM, classify data 
points in multi-dimensional space by establishing boundaries between classes. Dimensions 
corresponding to certain sensing elements can be irrelevant for differentiating between classes and in 
fact may be responsible for increased overlap between data points of different classes. Such a scenario 
results in a decrease in class separability and increases classification errors. Thus, when outputs from 
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such sensing elements are ignored in the process of classification, the CR can be increased, depending 
upon the specific classification task. Also, it was observed that the highest CRs for each subset of s 
was obtained by the SVM algorithm except for the case of a subset containing only 3 elements, in 
which the NB technique performed the best with a CR of 96.7%, using sensing elements 1, 4, and 11. 

It must be emphasized that this analysis was performed to establish baseline results using a nominal 
sparse detector configuration with a representative dataset containing 512 object signatures of humans, 
animals, and vehicles. Testing with additional algorithms, including the fusion of multiple algorithms, 
is warranted using more extensive and varied datasets, as well as testing modifications to the profiling 
sensor system assembly parameters, such as dpuch or sampling rate. Lastly, additional analysis of other 
classification tasks, such as human with no backpack versus human with a backpack, SUV versus car, 
horse versus deer, etc., is needed to provide a more comprehensive study of the classification rate as a 
function of the number of detectors, their arrangement, and the specific classification task. 

Table 2. Sensing element combination yielding the highest CR (SVM algorithm). 



Number of sensing elements in Sensing elements in the subset of s 
the subset of s (cardinality of s) (order as in Figure 2) 



1 


13 


79.4 


2 


4,11 


94.7 


3 


4, 10-11 


96.4 


4 


4, 10-12 


98.1 


5 


4, 5, 10-12 


98.7 


6 


1,4, 10-12, 15 


99.0 


7 


2, 4, 8, 11-13, 15 


99.2 


8 


1,2, 4, 8, 11-13, 15 


99.5 


9 


1-4, 8, 11-13, 16 


99.5 


10 


1,3,4-7, 11-14 


99.7 


11 


1-8, 11-13 


99.5 


12 


1-9, 11-13 


99.5 


13 


1-13 


99.5 


14 


1-13, 15 


99.5 


15 


1-15 


99.2 


16 


1-16 


99.2 



4. Conclusions 

A simple, active near-infrared prototype imaging sensor has been presented to show the feasibility 
of using a sparse detector array for capturing and subsequently classifying naive images or silhouettes 
of objects. The sensor appears to be a low-cost approach for discriminating among humans and non- 
humans in unattended ground deployments. Such an approach to designing an imaging sensor may be 
critical for wide-scale deployments in which the sensor is considered disposable; therefore, the cost of 
the sensor is crucial. An initial set of algorithms were evaluated using 512 datasets of humans, animals. 
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and vehicles with respect to their ability to classify the data acquired from the sensor into human 
versus non-human classes. Empirical analysis using these datasets shows that over ninety-nine percent 
(99%) accuracy is feasible in categorizing individually sensed objects into specific classes of interest. 
Although not included as part of the results summarized in this paper, the sensor and associated 
algorithms appear to be resilient to false alarms induced from non-human signatures; however, more 
detailed analysis is required to determine 'false alarm' probabilities. 

In addition, the sensor's low-level device driver software has been designed so that various 
classification algorithms can be inserted and objects can be classified in real-time. The software API 
has been wrapped to facilitate integration into a service-oriented architecture by providing a WSDL 
interface to the high-level sensor functionality. 

Future directions are being pursued along a variety of focus areas, including acquiring more 
signature data from the prototype sensor for a wider variety of objects typically found in open range. 
Also, a more detailed algorithm analysis that relates the resolution of the sensor to its object 
discrimination capabilities is needed to determine the optimal number of detectors and their 
arrangement within the sensor system for a given classification task. 

In many applications of the sensor, it will be imperative to execute the detection and classification 
algorithms using a light-weight, low-cost computing platform embedded within the sensor. Minimizing 
network bandwidth by transmitting only high-level classification results, as opposed to raw or 
compressed data or silhouettes is desirable in many potential applications. However, other applications 
are anticipated in which a human will be in the loop to directly classify the sensed data; therefore, 
human perception studies to evaluate the efficacy of manual classification of the naive images or 
silhouettes is also warranted. 

A power consumption analysis and optimization effort is needed prior to deployment, as well as 
developing techniques for camouflage or covert placement. Development of a second version of the 
prototype sensor, which will be more durable to support further field testing in typical application 
scenarios, is planned. Ultimately, the realization of the prototype sparse detector sensor concept as a 
'sensor on a stick' with a self-contained power supply and an embedded system for light-weight 
computation and communication capability within a network-centric, service-oriented environment is a 
worthwhile goal. 

Acknowledgements 

The anonymous peer reviewers are thanked for their suggestions that improved this paper. Funding 
for this work was provided in part by cooperative agreement W911NF-05-2-0019 between the 
University of Memphis and the U.S. Army's Research Laboratory (ARL), as well as funds from the 
Herff College of Engineering at the University of Memphis. This paper does not necessarily represent 
the position of the U.S. Government. Thanks to Kenny Emmanuel, Jeremy Brown, Yury Tritenko, 
Matthew Smith, Joseph Quails, and Irene Qiu for their assistance with the signature collection effort 
and software development support. 



Sensors 2008, 8 



8014 



References and Notes 

1. Russomanno, D.J.; Yeasin, M.; Jacobs, E.; Smith, M.; Sorower, M.S. Sparse Detector Sensor: 
Profiling Experiments for Broad-Scale Classification. In Proceedings SPIE-Defense and Security 
Symposium: Unattended Ground, Sea, and Air Sensor Technologies and Applications X 2008; 
volume 6963, pp. 69630M-69630M-11. 

2. Yeasin, M.; Russomanno, D.J.; Smith, M.; Sorower, M.S.; Shaik, J. Robust Classification of 
Objects from a Sparse Detector Sensor. In Proceedings of the International Conference on 
Machine Learning; Models, Technologies and Applications 2008, pp. 742-748. 

3. Russomanno, D.J.; Tritenko, Y.; Qiu, Q. A Web Service Interface for an Unattended Ground 
Sparse Sensor Detector. In Proceedings of the International Conference on Semantic Web and 
Web Services 2008, pp. 204-209. 

4. Sartain, R.B. Profiling Sensor for ISR Applications. In Proceedings SPIE-Defense and Security 
Symposium: Unattended Ground, Sea, and Air Sensor Technologies and Applications X 2008; 
volume 6963, 69630Q. 

5. Klett, K.K.; Sartain, R.; Alexander, T.; Aliberti, K. Optical and Radiometry Analysis for a Passive 
Infrared Sparse Sensor Detection System. In Proceedings SPIE-Infrared Imaging Systems: 
Design, Analysis, Modeling, and Testing XIX 2008; volume 6941, 694101. 

6. Dizard W. DHS Unveils Massive, Fast-Track Border Project. Government Computer News 2006, 
January 26. 

7. Strohm C. Border Tech Program is Plagued by Early Setbacks. Government Executive 2007, June. 

8. Robinson, A.L.; Halford, C.E.; Perry, E.; Wyatt, T. Sparse Detector Sensor Model. In 
Proceedings SPIE-Defense and Security Symposium: Unattended Ground, Sea, and Air Sensor 
Technologies and Applications X 200S; volume 6963, 69630L. 

9. Driggers, R.G.; Cox, P.; Edwards, T. In Introduction to Infrared and Electro-Optical Systems, 
Artech House: Norwood, MA, USA, 1999; pp. 189-227. 

10. Anonymous. CX-RVM5/D100/ND300R Data Sheet. Ramco Innovations 2007, Ramco: W. Des 
Moines, lA, USA. 

11. Schroeder, W.; Avila, L.S.; Hoffman, H. Visualizing with VTK: A Tutorial. IEEE Comput. 
Graph. Appl 2000, 20, 20-27. 

12. Singh, M.P.; Huhns, M.N. In Service-Oriented Computing; John Wiley and Sons: West Sussex, 
UK, 2005; pp. 19-42. 

13. Martinez, W.L.; Martinez, A.R. In Computational Statistics Handbook with Matlab; CRC press: 
Boca Raton, FL, USA 2002; pp. 319-331. 

14. Duda, D.O.; Hart, P.E. In Pattern Classification; John Wiley and Sons: New York, NY, USA, 
2001; pp. 117-121. 

15. Fukunaga, K. In Introduction to Statistical Pattern Recognition; Academic Press: San Diego, CA, 
USA, 1990; pp. 442-460. 

16. Hastie, T; Tibshirani, R.; Friedman, J. In The Elements of Statistical Learning; Springer- Verlag: 
New York, NY, USA, 2001, pp. 11-16. 

17. Seo, S.; Obermayer, K. Soft Learning Vector Quantization. Neural Comput 2003, 75, 1589-1604. 



Sensors 2008, 8 



8015 



18. Theodoridis, S.; Koutroumbas, K. In Pattern Recognition; Academic Press: San Diego, CA, USA, 
2007; pp. 96-104. 

19. Umer, M.F.; Khiyal, S.H. Classification of Textual Documents Using Linear Vector Quantization. 

Inf. Technol J. 2007, 6, 154-159. 

© 2008 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland. 
This article is an open-access article distributed under the terms and conditions of the Creative 
Commons Attribution license (http://creativecommons.Org/licenses/by/3.0/). 



